Bandwidth Cost of Code Conversions in Distributed Storage: Fundamental Limits and Optimal Constructions

نویسندگان

چکیده

Erasure codes have become an integral part of distributed storage systems as a tool for providing data reliability and durability under the constant threat device failures. In such systems, [ n, k ] code over finite field F q encodes xmlns:xlink="http://www.w3.org/1999/xlink">k message symbols from into xmlns:xlink="http://www.w3.org/1999/xlink">n codeword which are then stored on different nodes in system. Recent work has shown that significant savings space can be obtained by tuning to variations failure rates. Such necessitates xmlns:xlink="http://www.w3.org/1999/xlink">code conversion : process converting already encoded initial xmlns:xlink="http://www.w3.org/1999/xlink">nI, kI its equivalent final xmlns:xlink="http://www.w3.org/1999/xlink">nF, kF code. The default approach conversion is re-encode new code, places burden system resources. xmlns:xlink="http://www.w3.org/1999/xlink">Convertible codes recently proposed class enabling resource-efficient conversions. Existing convertible focused minimizing access cost, i.e., number accessed during conversion. Bandwidth, corresponds amount read transferred, another important resource optimize this paper, we study fundamental limits bandwidth used present constructions bandwidth-optimal codes. First, model problem using network information flow graphs with variable capacity edges. Second, focusing MDS parameter regime called merge regime, derive tight lower bounds bandwidth. derived show significantly reduced compared even regions where it been cost cannot reduced. Third, construction matches bound thus

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Explicit Code Constructions for Distributed Storage Minimizing Repair Bandwidth

Regenerating codes are a class of recently developed codes for distributed storage, that permit data recovery from any k of n nodes, and also have the capability of repairing a failed node by connecting to any d nodes and downloading an amount of data, termed the repair bandwidth, that is on average, significantly less than the size of the data file. These codes optimally trade the storage spac...

متن کامل

Code Constructions for Distributed Storage With Low Repair Bandwidth and Low Repair Complexity

We present the construction of a family of erasure correcting codes for distributed storage systems that achieve low repair bandwidth and low repair complexity. The construction is based on two classes of codes, where the primary goal of the first class of codes is to achieve a good fault tolerance, while the second class of codes aims at reducing the repair bandwidth and the repair complexity....

متن کامل

Cost-Bandwidth Tradeoff In Distributed Storage Systems

Distributed storage systems are mainly justified due to the limited amount of storage capacity and improving the reliability through distributing data over multiple storage nodes. On the other hand, it may happen the data is stored in unreliable nodes, while it is desired the end user to have a reliable access to the stored data. So, in an event that a node is damaged, to prevent the system rel...

متن کامل

Optimal Subsystem Code Constructions

In this paper, a generic method to derive subsystem codes from existing subsystem codes is given that allows one to trade the dimensions of subsystem and co-subsystem while maintaining or improving the minimum distance. As a consequence, it is shown that all pure MDS subsystem codes are derived from MDS stabilizer codes. Furthermore, a simple construction of pure MDS subsystem codes is obtained...

متن کامل

MDS Code Constructions with Small Sub-packetization and Near-optimal Repair Bandwidth

A code C ⊆ F is a collection of M codewords where n elements (from the finite field F) in each of the codewords are referred to as code blocks. Assuming that F is a degree ` extension of a smaller field B, the code blocks are treated as `-length vectors over the base field B. Equivalently, the code is said to have the sub-packetization level `. This paper addresses the problem of constructing M...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Information Theory

سال: 2023

ISSN: ['0018-9448', '1557-9654']

DOI: https://doi.org/10.1109/tit.2023.3265512